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Abstract — We consider the regular balanced model of formula 
generation in conjunctive normal form (CNF) introduced by 
Boufkhad, Dubois, Interian, and Selman. We say that a formula is 
p-satisfying if there is a truth assignment satisfying 1 2 * + p2 * 
fraction of clauses. Using the first moment method we determine 
upper bound on the threshold clause density such that there 
are no p-satisfying assignments with high probability above this 
upper bound. There are two aspects in deriving the lower bound 
using the second moment method. The first aspect is, given any 
p e (0, 1) and k, evaluate the lower bound on the threshold. This 
evaluation is numerical in nature. The second aspect is to derive 
the lower bound as a function of p for large enough k. We address 
the first aspect and evaluate the lower bound on the p-satisfying 
threshold using the second moment method. We observe that as k 
increases the lower bound seems to converge to the asymptotically 
derived lower bound for uniform model of formula generation 
by Achlioptas, Naor, and Peres. 

I. Regular Formulas and Motivation 

A literal of a boolean variable is the variable itself or 
its negation. A clause is a disjunction (OR) of k literals. A 
formula is a conjunction (AND) of a finite set of clauses. A 
A;-SAT formula is a formula where each clause is a disjunction 
of k Uterals. A legal clause is one in which there are no 
repeated or complementary literals. Using the terminology of 
||5l, we say that a formula is simple if it consists of only legal 
clauses. A configuration formula is not necessarily legal. A 
satisfying (SAT) assignment of a formula is a truth assignment 
of variables for which the formula evaluates to true i.e. all the 
clauses evaluates to true. We denote the number of variables 
by «, the number of clauses by m, and the clause density i.e. 
the ratio of clauses to variables by a ^ j. We denote the 
binary entropy function by h{x) = — xln(x) — (1 — .r)ln(l — x), 
where the logarithm is the natural logarithm. 

The popular, uniform k-SAT model generates a formula by 
selecting uniformly and independently m-clauses from the set 
of all 2'^(^) A:-clauses. In this model, the literal degree can 
vary. We are interested in the model where the literal degree 
is constant, which was introduced in [5|. Suppose each literal 
has degree r. Then 2nr = km, which gives a = 2r/k. Hence 
a can only take values from a discrete set of possible values. 
This problem can be circumvented by allowing each literal to 
take two possible values for a degree. However, in this paper 
we consider the case where all the literals have degree r due 



to space restriction. A formula is represented by a bipartite 
graph. The left vertices represent the literals and right vertices 
represent the clauses. A literal is connected to a clause if it 
appears in the clause. There are kan edges coming out from 
all the literals and kan edges coming out from the clauses. We 
assign the labels from the set E = {!,..., kan} to edges on 
both sides of the bipartite graph. In order to generate a formula, 
we generate a random permutation 77 on £ . Now we connect 
an edge / on the literal node side to an edge II{i) on the clause 
node side. This gives rise to a regular random k-SAT formula. 
Note that not all the formulas generated by this procedure 
are simple. However, it was shown in (5] that the threshold 
is same for this collection of formulas and the collection of 
simple formulas. Thus, we can work with the collection of 
configuration formulas generated by this procedure. Note that 
this procedure is similar to the procedure of generating regular 
LDPC codes ifTTI . 

The regular random fc-SAT formulas are of interest because 
such instances are computationally harder than the uniform 
^-SAT instances. This was experimentally observed in (|5], 
where the authors also derived upper and lower bounds on 
the satisfiability threshold for regular random 3-SAT. In |9|, 
upper bound on the satisfiability threshold for any ^ > 3 was 
derived using the first moment method. It was shown that as 
k increases, the upper bound converge to the corresponding 
bounds on the threshold of the uniform model |[T|, JS]. 

For uniform model, in a series of breakthrough papers, 
Achlioptas and Peres in |3| and Achlioptas and Moore in 
[1], derived almost matching lower bounds with the upper 
bound by carefully applying the second moment method to 
balanced satisfying assignments. In [l], based on their belief 
that the simple application of second moment method should 
work for symmetric problems, Achlioptas and Moore posed 
the question of its success for regular random A:-SAT. In 
an attempt to answer this question, the lower bound using 
the second moment method was evaluated in f9\. As the 
evaluation of the lower bound was numerical in nature (though 
exact), it was observed that the lower bound also converges 
to the corresponding lower bound for the uniform model as k 
increases. 

In this work we are interested in the maximum satisfiability 
problem over regular random formulas. We say that a formula 



is /^-satisfying if there is a truth assignment satisfying c{p) ^ 
1 — 2^^^ + pl^'^ fraction of its clauses, p G (0, 1). The number 
of p-satisfying truth assignments is denoted by N{n,a,p). We 
define the following quantities related to /^-satisfiability: 

'^(p) — sup{a : A regular random formula, 

is /:i-satisfiable w.h.p.}, (1) 
a*{p) = infja : A regular random formula, 

is not /?-satisfiable w.h.p.}. (2) 

Note that a{p) < a* {p). In I2j|, for the uniform model Achliop- 
tas, Naor, and Peres derived lower bound on a{p) which 
almost matches with the upper bound on a*{p) derived via 
the first moment method. The lower bound was obtained by a 
careful application of the second moment method. 

We derive upper bound on a*{p) by applying the first 
moment method to N{n,a,p). The obtained upper bound 
matches with the corresponding bound for the uniform model. 
We evaluate a lower bound on a{p) by applying the second 
moment method to N{n,a,p). We observe that for increasing 
k the lower bound seems to converge to the corresponding 
bound for the uniform model. In the next section, we obtain 
upper bound on a*{p) by the first moment method. 

Due to space limitations, some of the arguments are accom- 
panied by short explanations. Further details can be found in 
the forthcoming journal submission lilO I. 

II. Upper Bound on Threshold via First Moment 

Let X be a non-negative integer-valued random variable 
and E{X) be its expectation. Then the first moment method 
gives: P(X > 0) < E{X). Note that by choosing X to be the 
number of solutions of a random formula, we can obtain 
an upper bound on the threshold a*{p) beyond which no 
/^-satisfying solution exists with probability one. This upper 
bound corresponds to the largest value of a at which the 
average number of p-satisfying solutions goes to zero as n 
tends to infinity. In the following lemma, we derive the first 
moment of N{n,a,p) for regular random A:-SAT for k>2. 

Lemma 2.1: Let N{n,a,p) be the number of /^-satisfying 
assignments for a random regular k-SAT formula. TherQ, 



E{N{n,a,p)) - 2" 



c{p)an J {kan)l 

c{p)an 



,(3) 



where s{x) = (1 1 and coef ( ^— 



denotes the coefficient of x 



The probability of the all-zero vector being /j-satisfying is 
given by 

P (X = {0, ... ,0} is /7-satisfying) 
Number of formulas for which X = {0, ... ,0} is /:i-satisfying 
Total number of formulas 

The total number of formulas is given by {kan)\. The total 
number of formulas for which the all-zero assignment is a 
/9-satisfying is given by 

The binomial term corresponds to choosing c{p) fraction 
of clauses being satisfied by the all-zero assignment. The 
factorial terms correspond to permuting the edges among 
true and false literals. Note that there are equal numbers of 
true and false literals. The generating function s{x) corre- 
sponds to placing at least one positive literal in a clause. 
With these results and observing that coef = 

coef ((i(x)/x)'("'"",;c(2-'(''))"") , we obtain ©. ■ 
Remark: The computation of the first moment of N{n,a,p) 

is similar to the computation of the first moment for weight 

distribution of regular LDPC ensembles. There is large body 

of work dealing with first moment of weight distribution. So, 

we refer to liTTI and the references there in. 

We now state the Hayman method to approximate the coef- 

term which is asymptotically correct [11|. 

Lemma 2.2 (Hayman Method): Let q{y) = Y^iliY be a 

polynomial with non-negative coefficients such that qo ^ 

and qi ^ 0. Define 



dq{y) 1 
dy y' 



Then, 



coef(g(y)«,y" 



bq{y) ^y 



q{yu 



da^{y) 
dy 



(4) 



(y„)"V2™N(3'-) 



(l+o(l)), (5) 



where y^ is the unique positive solution of the saddle point 
equation aq{y) = uj. The solution y^ also satisfies 



yc 



= inf 



qiy)" 



(6) 



>'>0 y'^ 

We now use Lemma 12.21 to compute the expectation of the 
total number of /^-satisfying assignments. 

Lemma 2.3: Let N{n,a,p) denote the total number of p- 
satisfying assignments of a regular random k-SAT formula. 



.v(x) 



c{p)an 



in the expansion of Let t{x) = where s{x) is defined in Lemma IZTI Then 



Proof: Due to symmetry of the formula generation, any 
assignment of variables has the same probability of being p- 
satisfying. This implies 

E{N{n,a,p)) ^2"P{X ^{0,...,0} is /^satisfying.) . 

'We assume that kan is an even integer. 



E{Nin,a,p)) = 



„n((l-to)ln(2)) 



i7rc{py{l - c{p))bt{xk)an 



g«(Q/!(c(p))+c(p)Qln(f(jc^))-(^-Qc(p))ln(,rt))|-j (7) 

where Xk is the solution of at{x) = j^-y — I. The quantity at{x) 
and bt{x) are defined according to (|4|i. 



Proof: Using Stirling's approximation for the binomial 
terms (see (TT', p. 513]) and Hayman approximation for the 
coef term from Lemma |5] gives the desired result. ■ 
In the following lemma we derive explicit upper bounds 
on the clause density for the existence of p-satisfying assign- 
ments. 

Lemma 2.4 (Upper bound): Let a*{p) be as defined in (|2|i. 
Define alXp) to be the upper bound on a*{p) obtained by the 
first moment method. Then, 

2*^ln(2) 



a*{p)<al{p) 



(8) 



p + {l-p)\n{l-py 

Proof: Using (|6]l, we obtain the following upper bound 
on the exponent of E{N{n,a, p))) for any x > 0, 



lim 



In {E{N{n,a))) 



< {l-ka)ln{2)+ah{c{p))+ 



c{p)a\nit{x))-al^-~cip)j\nix). (9) 

We substitute x — 1 in (|9]l and obtain the following upper 
bound on the p-satisfiability threshold, 

ln(2) 



a*ip)< 



i) 



kln{2)-h{c{p))~c{p)ln{2'' 

We complete the proof by showing that the denominator in 
([Tol l is lower bounded by 2-'' {p + {I - p)ln{l - p)). This 
can be easily shown by considering their difference and then 
showing its positivity. ■ 
Note that the upper bound coincides with the upper bound 
derived for uniform formulas in ||2l . In the next section we use 
the second moment method to obtain lower bound on a{p) 
defined in (O. 

III. Second Moment 

In 111, almost matching lower bounds on the p-satisfiability 
threshold of uniform formulas were derived using the second 
moment method. The second moment method is governed by 
the following equation 



P{X>0) > 



(11) 



where X is a non-negative random variable. Before we use 
the second moment method, we present the following theorem 
from IS] and its corollary given in ||2l ■ Proof of both theorem 
and the corollary are identical for the uniform model and the 
regular model. 

Theorem 3.1 ( [6]): Let Uk{n,a) be the maximum number 
of clauses satisfied (over all the truth assignments) of a regular 
random formula with n variables and m clauses. Then 

2f2^ 

P{\Uk{n,a)~E{Uk{n,a))\ > t) <2exp ' 



Corollary 3.1 ( [2 J): Assume that there exists c = c{k,p,r) 
such that for n large enough, a regular random formula is 
p-satisfiable with probability greater than n^' . Then a regular 
random formula is p'-satisfiable with high probability for every 
constant p' < p. 



We now apply the second moment method to N{n,a, p). The 
calculation for p=l i.e. for number of satisfying assignments 
was done in [9J. Our computation of the second moment is 
inspired by the computation of the second moment for the 
weight and stopping set distributions of regular LDPC codes 
in 121, fSl (see also [4]). We compute the second moment in 
the next lemma. 

Lemma 3.1: Consider regular random satisfiability formu- 
las with literal degree r. Then the second moment of N{n^ ct,p) 
is given by 



an 
c(p)an 

2 



n c{p)an 

E{N{n,a,pf)^Y^ £ 2" 

'=0;=a«(f(p)-(l-p)2-*) 

^c{p)an\ fan{\ - c{p))\ {{r{n - i))\Y {{ri)\) 
J J V c{p)an-j J {kan)\ 
coef{f{xuX2,X3y{s{xi)s{^,)y<"^""-\ {xiX3Y^"-'h'^) , (12) 
where the generating function f{xi,X2,Xi) is given by 

f{xi,X2,X3) = (1 +Xl +X2 +x^)'' - (1 +xi)'' - (1 +x^)'' +1. 

(13) 

The generating function s{x) is defined as s{x) = (1 +x)* — 1, 
which is same as defined in Lemma ITT] 

Proof: For truth assignments X and Y, define the indicator 
variable ixr which evaluates to 1 if the truth assignments X 
and Y are /7-satisfying. Then, 

E{N{n,a,pY) = ^ £(1xy), 
XJe{QA}" 

= 2" ^ P (0 and Y are p-satisfying) . 

FG{0,1}" 

Due to the symmetry in regular formula generation, the 
number of formulas for which both X and Y are /^-satisfying 
depends only on the number of variables on which X and Y 
agree. This explains the last simplification where we fix X to 
be the all-zero vector. 

We want to evaluate the probability of the event that the 
truth assignments and Y /^-satisfy a randomly chosen regular 
formula. This probability depends only on the overlap, i.e., the 
number of variables where the two truth assignments agree. 
Thus for a given overlap /, we can fix Y to be equal to zero in 
the first / variables and equal to 1 in the remaining variables 
i.e. y = {0,--- ,0,1,--- ,1}. This gives. 



E{Nin,a,p) 



E2« 

1=0 



P (0 and Y are p-satisfying) . 



(14) 

In order to evaluate the probability that both and Y are p- 
satisfying, define C = {I,-- - ,an} to be the set of clauses 
and Co and Cy to be the set of clauses satisfied by and 
Y respectively. Clearly, |Co| — \Cy\ —c{p)an. Then, 

P (0 and Y are p-satisfying) = 

E P(0 only satisfies Q and Y only satisfies Cy) . (15) 

Co.CyCC 



Again from the symmetry of the regular formula generation, 
we fix Co = {I,-- - ,c{p)an}. For ICoHCfI = j, we fix Cy — 
{I,-- - J,c{p)an + l,...,2c{p)an-j}. This gives. 



P(0 and Y are p-satisfying) 



c{p)an 

c{p)an\ / an{l — c{p)) 
j )\c{p)an-i 



c{p)a.n 

L 

i=Q«(c(p)-(l-p)2-*) 

X P(0 only satisfies Co and Y only satisfies Cy) ■ (16) 

Note that j > an{c(p) - {\ - p)!''') as |Co| + \Cy\ - |Co n 
Cy \ < an. For a given overlap / between and Y, we observe 
that there are four different types of edges connecting the 
literals and the clauses. There are r(n — /) type 1 edges which 
are connected to true literals w.r.t. the truth assignment and 
false w.r.t. to the Y truth assignment. The ri type 2 edges 
are connected to true literals w.r.t. both the truth assignments. 
There are r{n — i) type 3 edges which are connected to false 
literals w.r.t. the truth assignment and true literals w.r.t. to 
the Y truth assignment. The ri type 4 edges are connected to 
false literals w.r.t. both the truth assignments. Let f{xi,X2,X3) 
be the generating function counting the number of possible 
edge connections to a clause such that the clause is satisfied 
by both and Y. In f{xi,X2,Xi), the power of x, gives the 
number of edges of type /, / g {1,2,3}. A clause is satisfied 
by both and Y if it is connected to at least one type 2 edge, 
else it is connected to at least one type 1 and at least one type 
3 edge. Then the generating function f{xi,X2,X3) is given as 
in (fT3l ). Using this, we obtain 

P(0 only satisfies Co and Y only satisfies Cy) — 

((r(«-0)!)^((n)!)\ , 
{kan)\ 

coef(/(:.i,;.2,:^3)^'(*(:^i)*(:^3))^'<'''""-',(xiX3)'"("-')4'), (17) 

where s{xi) is the generating function for clauses satisfied 
by and not satisfied by Y (similarly we define s{xt,)). The 
term {kan)l is the total number of formulas. Consider a given 
formula which is satisfied by both truth assignments and 
Y. If we permute the positions of type 1 edges on the clause 
side, we obtain another formula having and Y as solutions. 
The argument holds true for the type / edges, / e {2,3,4}. 
This explains the term (r(n — /))! in (fTTT l which corresponds to 
permuting the type 1 edges (it is squared because of the same 
contribution from type 3 edges). Similarly, {ri)]^ corresponds 
to permuting type 2 and type 4 edges. As jConCyl = j, 
there are j clauses which satisfied by both and Y. This 
explains the factor f{xi,X2,xj)^ in the coef term. There are 
|Co\(CoUCf)|) = \Cy\{CoUCy)\ ^ c{p)an- j clauses which 
are satisfied by (resp. Y) and not satisfied by Y (resp. 0). 
This explains the factor (i(;ii;i)i(;ii;3))'^('''""^-' in the coef term. 
We complete the proof by substituting (1T\ in ( fTSI l, then (fTSI l 
in dull. ■ 
In order to evaluate the second moment, we now present 
the multidimensional saddle point method in the next lemma. 



A detailed technical exposition of the multidimensional saddle 
point method can be found in Appendix D of 1 1 1 1 . 

Theorem 3.2: Let /:= {r{n — i),ri,r{n — i)), x— {xi,X2,xt,), 
and < lim„^oo i/n < 1 . We define 

where f{x) and s{x) are defined in Lemma [3T| We define the 
normalizations rj = i/n and 7 = j/(an). Let f — {11,12,(3) be 
a positive solution of the saddle point equations 

3 



ag{x)-- 



Xi 3ln(g„j(i)) 



= {r{l-rj),rTj,r{l-,j)}. (18) 



n dxi 

Then coef (^g„j{x),x-) can be approximated as , 

gn.jiL) 



coef(g„jU),i-) 



a)iV(27r«)3|B,(r)| 



using the saddle point method for multivariate polynomials, 
where B^{x) is a 3 x 3 matrix whose elements are given by 
Bi j = Xj —Bjj and flg,(x) is the coordinate of ag{x). 

In the following theorem we derive lower bound on the 
p-satisfiability threshold by evaluating the second moment of 
N{n,a,p) with the help of Theorem 13.21 

Theorem 3.3: Consider regular random A:-SAT formulas 
with literal degree r. Let S{i,j) denote the summation 
term in (fT2] i. Define the normalization rj — i/n and 7 = j/ (an). 
If S{n/2,nac{pY) is the dominant term i.e. for 77 G [0, 1],7 G 
[{c{p)-2->^{\-p)),c[p)U^ i, and 7 ^ c{pY 

^^HS{hnac{pf))^^^MSi,n,,an))^ ^^^^ 

then for some positive constants c, c' 

V{N{n,a,p)>Q)>c'n-'. (20) 

Let r*{p) be the largest Uteral degree for which 
S{n/2,nac{py) is the dominant term, i.e. ( fT9] l holds. 
Then the threshold a{p) defined in ^ is lower bounded by 

Proof: Assuming ( fT9l ) holds, then for n large enough 

E{N{n,a,pf) < {n + l){a{l - p)2-''n+l)S (^^,nac{p)' 

(21) 

From ( fT2b and Theorem 13.21 the growth rate of S{rin,'-fan) is 
given by, 

,(,,7) ^ lim '"^^^'^"'^""^^ = (1 -M(ln(2) 

„-j.oo fl 

+ ah{c{p))+ac{p)h[^]+a{l~c{p))h'' ^^^'^^^ 



+ ja In (/(f 1 , f2 , f3 ) ) + a{c{p) - 7) (ln(i (f 1 ) ) + ln(i (fj ) ) ) 

- r ( 1 - ??) (ln(f 1 ) + ln(f3 ) ) - rr? In(f2) , (22) 

where ti,t2,tj is a positive solution of the saddle point equa- 
tions as defined in Theorem 13. 2[ 

(23) 



agit) = {r{l-T]),rT],r{l-r))}. 



In order to compute the maximum exponent of the summation 
terms, we compute its partial derivatives with respect to 77 
and 7 and equate them to zero, which resuh in the following 
respective equations. 



In 



(l-to)ln 
7(l-2c(/,)+7) 



1-,; 



In 



flf3 
t2 

S{ti)s{t3) 



rln — - 0, (24) 



= 0. (25) 



Note that the partial derivatives of fi,f2 and fs w.rt. 77 and 7 
vanish because of the saddle point equations given in ( l23T l. 
Every positive solution (fi,f2,f3) of (|23] l satisfies ti = as 
(|23] l and f{ti,t2,to,) are symmetric in ti and to,. If 77 = 1 /2,7 = 
c{p)^ is a maximum, then the vanishing derivative in ( l24b and 
equality of ti and tj, imply t2 — t\. We substitute 77 = 1/2, 
ti = f3, and f2 = in (l23l l. This reduces ( |23] | to the saddle 
point equation corresponding to the polynomial t{x) defined 
in Lemma |23] whose solution is denote by x^. By observing 
f{xk,xl,Xk) = s{xk)^, we have 



S {^,ac{pfn 



32TTMpY{\-c{p)Y^\B,{xk,xlxk)\n^ 

^2n{(l-ka)\n(2)+ah(c(p))+ac(p)Hs{xt))-'^\n(xt))^Y +o(l)). (26) 

Using the relation that t{x) = we note that the exponent 
of S{n/2,ac{p)^n) is twice the exponent of the first moment 
of the total number of solutions as given in We substitute 
( |26] l in to ( |2TI ). Then we use Lemma [23] and (|2T] ) in the second 
moment method: 



P{N{n,a,p) >0) > 



E{N{n,a,pY 
E{N{n,a,pY 



> 



J\Bg{xk,xl,Xk)\ 



bt{xk)a^Vkn 

Clearly, if the maximum of the growth rate of S{rin,^an) is not 
achieved at 77 = 1 /2 and 7 = c{p)'^ , then the lower bound given 
by the second moment method converges to zero exponentially 
fast. By using Corollarv l3.1| we obtain the desired lower bound 
on a{p). This proves the theorem. ■ 
In the next section we discuss the obtained lower and upper 
bounds on the ^-satisfiability threshold. 

IV. Bounds on Threshold and Discussion 

In Table U we compute the ratio of aj{p) and a*i{p) for 
p = O.l,--- ,0.9 and k — 3,6,12, where a*{p) is the lower 
bound on a{p) obtained from Theorem 13.31 and al{p) is the 
upper bound on a*{p) defined in Lemma |Z41 Note that the 
case p — 1 was already solved in ||9l. In order to apply the 
second moment method, we have to verify that 5(77,7), defined 



in 



attains its maximum at rj = = c{p) over [0, 1 



[c{p) — (1 ~ p)2^'^,c{p)]. This requires that 77 = JtI — c{p)^ is 
the only positive solution of the system of equations consisting 
of ( l23T l, ( |24] | and ( l25T l which corresponds to a maximum. The 
system of equations (l23T l, ( |24] |. and (1251 ) is equivalent to a 



p 


k = 3 


k = (} 


k = \2 


0.1 


0.252 


0.717 


0.977 


0.2 


0.258 


0.720 


0.979 


0.3 


0.272 


0.738 


0.980 


0.4 


0.281 


0.755 


0.981 


0.5 


0.295 


0.765 


0.983 


0.6 


0.308 


0.782 


0.986 


0.7 


0.325 


0.801 


0.988 


0.8 


0.344 


0.822 


0.990 


0.9 


0.402 


0.855 


0.993 



TABLE I: Value of the ratio al{p)/a*{p). 



system of polynomial equations. For small values of k and p 
close to one, we can solve this system of polynomial equations 
and verify the desired conditions. For larger values of k, the 
degree of monomials in ( l24l i grows exponentially in k. Thus, 
solving (|23] ). (l24l) . and ( |25] ) becomes computationally difficult. 
However, 5(7^,7) can be easily computed as its computation 
requires solving only ( |23] |. where the maximum monomial 
degree is linear in k. Thus, the desired condition for maximum 
of 5(77,7) at 77 = 5,7 = c{p)^ can be verified numerically in 
an efficient manner 

From the Table H] we see that as k becomes larger the ratio 
al (p) / ct*,{p) gsts closer to one. Our belief is that indeed as 
k becomes larger and larger, the ratio a1{p)/al{p) converges 
to one. Our main future goal is to derive explicit expression 
for ct^ip) as k becomes larger. 
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